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Abstract 

In this paper, a finite-state mean-reverting model for tlie short-rate, based on the 
continuous time Ehrenfest process, will be examined. Two explicit pricing formulae for 
zero-coupon bonds will be derived in the general and the special symmetric cases. Its 
limiting relationship to the Vasicek model will be examined with some numerical results. 
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1 Introduction 

One of the fundamental approaches to term structure modelling is based on the specification 
of the short-term interest rate - the short-rate. Vasicek [20j first introduced a mean-reverting 
short-rate model with Gaussian distribution and derived a closed-form representation for the 
zero-coupon bond (ZCB) price. Since then, a variety of short-rate models have become 
established, each having its advantages and disadvantages. 

Albeit the earliest, Vasicek's model is still very popular among practitioners owing to its 
analytical tractability with regard to ZCB prices and European options thereof. Unfortu- 
nately, the model has some shortcomings. The most prominent of these is the possibility for 
the interest rates to become negative - a fact relating to all models with Gaussian distribu- 
tion. Even though the probability of negative rates is rather small, not only is the realism of 
the model questionable, but also problems may appear while valuing ZCBs with a long time 
to maturity and a low interest rate level. 

The idea of using both the discrete and the continuous time versions of the Ehrenfest 
process in finance is well-known. The discrete time approach was used, for example , by 
Okunev and Tippett [14| in modelling accumulated cashflows, by Takahashi J9] in exploring 
changes in stock prices and exchange rates for currencies, and by Buehlmann [5] in modelling 
interest rates. With regard to the modelling of interest rates, it seems that the discrete time 
approach leads in general only to a recursively computable term structure. Sumita, Gotoh 
and Jin |il7j studied the passage times and the historical maximum of the Ornstein-Uhlenbeck 
process via an approximation by means of a special case of the continuous time Ehrenfest 
process. 



This paper proposes a finite-state mean-reverting model for tfie short-rate related to the 
continuous time Ehrenfest process. By choosing arbitrary lower and upper bounds for the 
rate, the respective short-rate process can be seen as a suitably linearly transformed birth- 
and-death process on {0, 1, . . . , A^}, N € N. By choosing the lower bound as non-negative, 
the problem of negative interest rates can be avoided. Furthermore, the model allows an 
explicit evaluation of ZCB prices. In this way, the model aims at realism and analytical 
tract ability. 

The main outcome of the paper is the derivation of pricing formulae for ZCBs in the 
general and the special symmetric cases of the model. In both cases the arbitrage-free ZCB 
price at time t and maturity T is given as follows: 

P{t,T) = C ■ Pi{t,T)^ ■ Po{t,Tf~>', 

where C is a constant, k G {0, 1, . . . N}, and Pi and Pq can be expressed in terms of the 
iFi hypergeometric functions of a matrix argument given in Section [2] (see also [7J). In 
the general case the model is governed by five parameters - a valuable fact considering the 
fitting of the model to the market data. The special case provides four parameters and 
is characterized by the symmetry of the underlying distribution with respect to the mean- 
reverting value. The advantage here is that we have more tractable expressions of Pi and Pq 
from the computational point of view. Moreover, a suitably transformed symmetric case of 
the model yields the Vasicek model in the limit as A'^ tends to infinity. 

The paper is organized as follows: the following section gives a short review of the special 
functions we shall encounter throughout this paper. Section three deals with the Ehrenfest 
process in continuous time. The main results are given in section four, where the Ehrenfest 
short-rate model is defined and the ZCB pricing formulae are derived. The fifth section gives 
an overview of the Vasicek model and its limiting relationship to the Ehrenfest short-rate 
model. Section six illustrates the advantages of the Ehrenfest short-rate model. 

This paper comprises part of my Ph.D. project at the Technische Universitat Dortmund. 

1 am deeply indebted to Professor Michael Voit for his patience and for being a wonderful 
research mentor. 

2 Special functions and orthogonal polynomials 

Throughout this paper we will make use of some well-known facts concerning the Krawtchouk 
polynomials (see [TO], [T8] and [21]) and iPi functions (see [7J), as well as some of their 
practical implications. In the interests of clarity we give in this section an overview of these 
special functions. 

Hypergeometric functions of matrix argument 

2.1 Definition. (a) A partition m := (mi, m2, . . . , m„) is an n-tuple (n G N) of non- 
negative integers such that mi > m2 > • • • > rUn- 

(b) For a partition m the generalized Pochhammer symbol is defined by 

n 

:= JJ(a-i + l)mj., 
where (a)^ := a{a + 1) • • • (a + A; — 1) denotes the usual Pochhammer symbol. 
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(c) For a partition m the normalized Schur function of index m is defined by 
Zm{z) := \m\l ■ 



Ui<j<k<nirnj -ruk-j + k) det(z™^^" ^) 



n"=i("ii +n-jy. Ui<t<j<nizi - Zj) ' 

(d) The hypergeometric function pFq of matrix argument is defined as a real-analytic 
function on the space £*„ (n G N) of n x n Hermitian matrices with eigenvalues 
z := {zi, . . . , Zn)^ e M", and is given by the series 



j=0 |m|=j 



where, for 1 < i < p and 1 < j < Oj G C and none of the numbers —hi + j — 1 is a 
non-negative integer. 

We will be particularly concerned with the iFi function, which is also known as the 
confluent hypergeometric function of matrix argument. From Theorem 4.1 in p'J we know 
that it converges absolutely for all z G Sn- An important result, that will be crucial later on, 
is given without proof in the following remark (see jTj, p. 25). 

2.2 Remark. Let A„ denote the standard simplex in M" (n G N), defined by 

An := |(xi,...,Xn) -.Xi >0,i= l,...n,E^i < l| • (2) 

Then for a > the following equation holds: 

a-l 



iFi{l; a + n; z) = {a)n J ( 1 " E J • exp f ^ ^i^ij dxi . . . dxr. 



(3) 



In order to compute the pFq function numerically we truncate the series ^ hy m<H 
as follows: 

,Ff(ai,...,a,;6i,...,6,;.):=X; E ' " " {°^|- • (4) 

i=o-^- H=i i^iJ™ • • • i^^J™ 

Koev and Edelman provide an effective algorithm for computing the pF^ function. For 
z £ Sn the complexity of their algorithm is linear in n and subexponential in H, which is 
acceptable if we consider the fast convergence of the power series Q. 



Kravi^tchouk polynomials For given N £ N and 0<p=l — g<l the Krawtchouk 
polynomials Ki{x) := Ki{x; N;p) are the orthogonal polynomials that relate to the binomial 
distribution B]\f^p and the probability mass function uj{x) := [^)p^q^~^ at the points x = 
0,1, . . . , N. They can be defined in two different, but equivalent ways. 

2.3 Definition. For x, / G {0, 1, . . . , iV} we set 

Ux) := ,M-l, -X- -N; 1/p) := E ^'^f^ (^) ^ , (5) 
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or 



Ki{x) 



-1 



k=0 



N — x\ fx 



I — k J \k J \p 



where 2-^1 is the classical Gauss hypergeometric function (see [18], §4.21). 

Definition |2.3| leads to the following basic well-known properties: 
2.4 Lemma. (a) Symmetry: 

Ki{x) = K^{l) 

for allxj G {0, 1, . . . , A^}. 
(6) Ko{x) = Ki{0) = 1 for all X, I £ {0,1,..., N}. 

(c) i^i(x) = l-^. 

(d) Generating function: 



p 



N 



k=0 



for all x£{0,l,.. .,N}. 
(e) Recurrence relation: 

- xKiix) = {N- l)vKi^i{x) - [{N - l)p + Iq] Ki{x) + lqKi_,{x) 

for allx,l e {0,1,..., A^}. 
(/) Orthogonality relation: 



N 



'^Ki{x)Km{x)uj{a 



where 



x=0 



N\ fpV 



uj{l)q 



-N 



for alll,mG {0, 1, . . . , A^}. 
(g) Forl,m£{0,l,...,N}, m<l 



N 



Bm,l ■= '^xKi{x)Km{x)u}{x) = < 



x=0 





-Iqiri^i 



if m < I — 2, 
if m = I — 1, 



^ iiN-l)p + lq)TTi if m = l. 



Note that Bim = B„ 



(6) 



(7) 



(8) 



(9) 



(10) 



(11) 



(12) 



Proof, (a) — (e) follow from ([s]), (/) follows from ^ and (g) is a direct application of ^ 
and ^ (see e.g. fTE\, §2.82). □ 
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3 Original Ehrenfest model 



The original Ehrenfest model describes the heat exchange between two isolated bodies, each 
of arbitrary temperature. The temperatures are symbolized by the number of fluctuating 
balls in two urns with a total oi N gN balls. For details of the continuous and discrete time 
versions of the model we refer to [2], [1^, [12] and [T6] . 

In this section we shall discuss the continuous time Ehrenfest process. Primarily, its 
representation as a sum of independent simple processes will allow us to show the main result 
of this paper which comes up in the next section. Furthermore, we examine the transition 
semigroup of the Ehrenfest process and explore some of its basic properties. 



Ehrenfest process Let N balls, initially distributed between urns I and II, fluctuate in- 
dependently in continuous time between the two urns. We fix a fluctuation parameter A > 
and independent Poisson processes {N^)t>o, . . . , {Nf^)t>o with intensity A. Let (yn)n6N be a 
Markov chain with the state space {0,1} and transition probability matrix 

^:=f'"/ .\)- (13) 



P 1-/3 

Then, the subordinated Markov chain {Y^ := Y]^t)t>o describes the state of the l-th ball at 
time t, where 1^' = 1 or when the l-th ball is in urn I or II respectively. Hence, 

N 

Xf-=TYl)^_ (14) 



1=1 



is a Markov process with the state space E := {0, 1, . . . , A^}, denoting the number of balls 
in urn I at time t. We call {Xt)t>o the (continuous time) Ehrenfest process. A discrete time 



version of (14) with arbitrary a and (3 was studied by Kraft and Schaefer |12j . 



3.1 Remark, (a) A special case of (14) with a = (3 = 1, first suggested by Siegert [T6] and 
also studied by Bingham |2j , where the transitions become "deterministic" in the sense 
of switching between the states and 1, will be important for us later on in Section |4j 
(Its discrete time analogue leads to the original Ehrenfest chain). 



(b) Karlin and McGregor [10] provided an alternative but equivalent definition of (14) as a 
birth- and- death process with the state space E. Here, the time intervals between events 
are independently exponentially distributed with intensity 7, and for i € E the birth 
and death rates are Aj := 70 ^^^^ and //j = 7/?]^ respectively, where a and f3 are given 
as above. It can be verified that in the setting at hand 7 = AA^. 

Before computing the transition semigroup of {Xt)t>o we need the following result: 

3.2 Lemma. The transition semigroup of {Y^)t>o is given by 

~ yq- qe^Mc+i3)t ^ _^ ^^-x{a+f))t J > K^^) 
where p := and q := 1 — p. 
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Proof. Since {Yf)t>o is a Markov chain subordinated by a Poisson process with index A, the 
associated transition semigroup can be written as (see [3j, p. 333) 



oo 



We can avoid the computation of by writing P = fiS + (1 — /i)/, where S = \ ^ ) is a 



stochastic matrix with S'^ = S, I is the identity matrix, and fi := a + (3. Then, (16) becomes 

-A(a+gHf^ (A(a + m" 



oo 

n! 



n! 
n=0 



Since 5" = 5" for all n G N, we can easily compute the above series, which completes the 
proof. □ 



3.3 Remark. Analogue to the proof above, we see that a variation of a and /3 in (15) can 
be equivalently described as a suitable modification of p and A. Thus, the distribution of 
(^)t>o depends only on the two parameters p and A or, equivalently, a and /3. 

3.4 Theorem. (Properties of the Ehrenfest process) Let {Xt)t>o be the Ehrenfest process 



given by (14)- 



(a) The transition probabilities Pij{t) := ¥'{XtJ^s = j\Xs = i) are given by 

where p := ^j^, q := I — p, and Ki{.) := Ki{.;p;N) {i G E) are the Krawtchouk 
polynomials as given in Definition\2. 



(b) The conditional mean and variance of {Xt)t>o are given by 

E[Xt\Xo = i] = Np- {Np - i)e"^("+^)*, (18) 
Yar[Xt\Xo = i] = iVp(l - p) + (A^p - i)(2p - l)e-^("+'')* (19) 
-(A^p-i)2(2p-l)e-2^("+/3)*. 

(c) The stationary distribution is the limiting distribution, and it is given by the binomial 
distribution B^^p on E with parameter p. 

Proof. Ad (a) : The proof here is similar to that in [2]. First, we compute the moment- 
generating function of (y/)t>o, given Y^. In the following we suppress the dependence of 



3.2 



we have 



{Yl)t>Q on I when it is clear from the context. From Lemma 

E[z^' 1^0 = 1] = Pi^{t)+pii{t)z = q+pz-q{l-z)e-^^''+^'^\ 
E[z^'|yo = 0] = pQ^{t)+p^i{t)z = q+pz + p{l-z)e-''^''+f'^K 
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Since {Yl)t>o are independent for all / = 1, . . . , A^, we obtain from ( 14) the moment-generating 
function of {Xt)t>o-, given = i, as follows: 



N 

j=0 



E 



= « 



E 



N 



= ^ 



.1=1 



E [z^'\Yo = 1] -E [z^*|yo = 



,N- 



q + pz — q{l — z)e 



-\(a+l3)t 



q + pz + p{l — z)e 



-\{a+l3)t 



N~i 



(q+pz) 



N 



1 



p q + pz 



Applying (a) and (d) of Lemma 2.4, we obtain 

N N 



2,-0 



x=0 

Applying Q once again, we get 

N N 



q pz 
p q 



N-x 



-\{a+l3)xt 



j=0 x=0^^^ [j=0 ^ ^ ^ ^ 

^ (p^ 



-\{a+l3)xt 



E 



x=0 



X{a+(3)xt 



Equating coefficients of z^ completes the proof of the claim. 

Ad (6) : Combining (17) with results (a), (6) and (g) of Lemma 2.4 we obtain 

N N N 

E[Xt\Xo = i] = j;j-p.,(t) = Ej-a;(j)E7r,i^,(x)i^,(x)e-^("+«-* 

j=0 j=0 x=0 



N 



N 



X e 



-X{a+I3)xt 



x=0 
N 



y^^Tr.j:Bo^^Ki{x)e~ 



j=0 



X{a+I3)xt 



(20) 



x=0 
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where Bq^x 7^ for x G {0, 1} as given in (12), which implies (18). Moreover, 

N N N 

j=0 j=0 x=0 

N N 

x=0 j=0 

N N 

x=0 j=0 



+ [{N - x)p + xq] K^{j) - xqK^^iij) 



N 



y^^iTxKi{x)e 



-X{a+I3)xt 



x=0 



-{N - x)pBo,x+i 



+ [(A^ - x)p + xq] Bo^x - xqBo,x-i 



where i3o,x is given by (12). A straightforward computation of 

Yar[Xt\Xo = i] = E[Xf\Xo = i] - E[Xt\Xo = if 



leads immediately to ( [19^ . 
Ad (c) : Clear. 



□ 



4 Ehrenfest short-rate model 

In this section we introduce a finite-state mean-reverting short-rate model associated with 



the continuous time Ehrenfest process ( 14) and give its basic properties. As a main result, we 



exploit the algebraic-combinatorial roots of the Ehrenfest process and derive explicit pricing 
formulae for ZCBs in the general and the special cases of the process, both of which have 
their advantages. 

Definition and properties Let [rm,rM] C R be an interval on the real line. We decom- 
pose it into N equal pieces of length h := and consider the process 



(21) 



as a short-rate process with state space E ■= {rk := hk + rm, k = 0,...,N}, where (X^'^^] 



is the Ehrenfest process given by (14) with a,/3 G (0,1]. Considering Remark 3.1 (6), we 
notice that {R[^^)t>o can be seen as an affine linearly transformed birth-and-death process 
on {0, 1, • • • , A^}. In the case at hand, can be interpreted as the state space discretization 



t )t>0 



also depends on A''. We will suppress this dependence 



parameter. Clearly, (Rt := R 
when it is clear from the context. Bearing in mind Remark 3.3, we denote this short-rate 
model as £{p, A) model. 
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From Theorem 3.4 we immediately obtain the conditional mean and variance of {Rt)t>o, 
given Rq := rk £ E, as follows: 



E[Rt\Ro = hk + rm] 
Yar[Rt\Ro = hk + Vm] 



h-E[Xt\Xo = k]+rm 

{VM - Tra) (p - {p - + r^, (22) 

-YarlXtlXo = k] 

^''^ ~ (iVp(l -p) + {Np - i){2p - l)e-^("+^)* 

-(A^p-f)2(2p- l)e-2^("+^)*), (23) 



Ar2 



where p = We also obtain the mean reversion of {Rt)t>o 

pvM + (1 -p)rm, 



lim E[Rt] 

t—^oo 



hm Yar[Rt] 

t— >oo 



N 



p{l — p) < oo. 



(24) 
(25) 



Thus, we have a total of five parameters, r^, ^m^P, A and N, to fit the model to the market 



data. Here, p governs the skewness of the underlying distribution, ■ 



, N and p have an 



impact on its kurtosis, and A infiuences the speed of reversion to the mean reverting value 
prM + (1 -p)rm- 

Zero-coupon bond We assume that an equivalent martingale measure (or risk-neutral 
measure) exists (see [13], Prop. 4.2), and that the underlying probability measure P is this 
equivalent martingale measure. Let {Ft)t>o be the natural filtration of {Rt)t>o- Then, the 
arbitrage-free ZCB price at time t with a face value of 1 monetary unit and maturity at T is 
given by (see [l], p. 51) 



pW{t,T)=E 



exp 



ii.s ds 



Ft 



r £ E. 



(26) 



In the following we also omit explicitly writing out the dependence of P(t, T) on N when it 
is clear from the context. 



The calculation of (26) within the £{p,\) model with arbitrary a,/3 £ (0,1] is inspired 
by the proof of Theorem 3.1 in [6]. There, Delbaen and Shirakawa represent the transition 
probabilities of the underlying short-rate process as a weighted series of the Jacobi polyno- 
mials. Using orthogonality relations of the Jacobi polynomials, they obtain a pricing formula 
for ZCBs in the associated model. However, this formula is only semi-explicit, since it con- 
tains multiple integrals that have to be calculated iteratively. We will avoid this problem by 
representing such integrals in terms of iFi functions. 



4.1 Theorem (ZCB price in £{p,X) model). Let {Rt)t>o be given by the definition (21) with 



a, /? G (0, 1], p = and A > 0. The price at time t > of a ZCB with maturity at T, 
conditional on Rt = r £ E, is given by 



P{t, T) = e-^™(^-*) • Pi{t, Tf ■ Po{t, T) 



N-k 



(27) 
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where k := ^^-^ G {0, . . . , iV} and for m G {0, 1} 

X-h{T-t)r 



Pm{t,T) := l + ^(-ir 

n=l 



ri! 



(28) 



ii=0 i„=0 



• iFi l;n + l;z 



Here, Km is given according to (b) and (c) of Lemma 2.4 ^ iFi is defined by 
-A(a + P){T -t){ii,..., G M", io := 0, and 



^(n) 



Zj(p-l) i/ = - 1, 



\ij{l-2p)+p if ij^i=ij. 

Proof Let r^ = hk + rm, k G {0, . . . , N}, be the state of {Rt)t>o at time t. Bearing in mind 
that {Rt)t>o is a Markov process, and using the definitions (14) and (21), we get from (26) 

P{t,T) = E exp I - / I /i> yi + r™, 1 ds 



-r^(T-t) 



■E 



N 



J|exp 



.1=1 



T 



hY ds 



Xt = k 



Xt = k 



exp 



T 



-, k 



•E, 



exp 



7V-fc 



(29) 



where • ] := E[ . \Yt = m] for m G {0, 1}. The last equahty holds because of the inde- 

pendence of (Yj)t>Q for all / = 1, . . . , A^. In the following we omit writing out the dependence 
on particular I, and set 



Pm{t,T) :=E 



m,t 



exp 



/lY, (is 



, mG{0,l}. 



Using (30), we rewrite ([29j) as follows: 

p{t,T) = e-'-™(^-*) . Pi{t,Tf ■ Po{t,T)^-\ 

From the power series representation of the exponential function, we obtain 

rT 



Pmit,T) = E 



hY^ ds 



n=0 ^ 
^ f-T f-T r-l 

1 + ^(-l)"/i- / / . . . / E„,t[y,„ • • • dsn... ds2dsu 



(30) 



(31) 



(32) 



where t =: sq < si < ■ ■ ■ < Sn < T. The last equality follows from 



I Ysds\ =nl / . . . / Ys„---Ys^dsn-.. ds2dsi 



10 



and the dominated convergence theorem. 

For the given t < si < • • • < s„ < T and tj := Sj — Sj-i we have 

1 In 
mi=0 m„=0 j=l 

where mo := m. Using the symmetry relation Q, we write the transition probabihties 
(tj) given in Theorem as fohows: 



Pmj.i,m,{tj) = w{mj) ^ ■n-ijKmj_Aij)Kmj{ij)e 



where Aj^. := A(a + I3)ij. Analogue to the calculation of the expected value (20) in the proof 
of Theorem 13.41 we obtain 



i„=0 



where Bi^_-^^i^ is defined by (12) and iq := 0. Iteratively, we get 

1 1 



ii=0 in=0 



Hence, (32) becomes 



Pm{t,T) = l + ^(-l)-/j"^...^K^(i„) 



n=0 



n=o in=o 



i=i 



(33) 



/ ■■■ exp -y^Ai^(Sfc -Sfc-l) dSn - . -(182(181. 

Jt J Si JSn^i y J 



In order to evaluate the multiple integrals above, we tranform the integration domain to the 
standard simplex A„ defined by ^ via the following mapping: 



J : 



nn , iron 



\snj 



( {T-t)8i + t \ 

{T-t){8l + 82) + t 

\{T-t)isi + --- + 8n)+tJ 



(34) 



Using (33), we rewrite (34) as follows: 



1 1 



P^{t,T) = l + ^(-l)"/i"^...^i^„(i„) 



n=0 



ii=0 in=0 



i=i 



(35) 



•(T-t)" / e<"'" dx, 

J An 
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where z^"'^ := — (T — t)(Aj^, • • • , Ai„)'^ G M" and (• , •) denotes the standard inner product. 
Applying ([s]) with a = 1, we express the integrals in (35) as iFi functions as follows: 



n! / e<^^"''^> dx= iFi(l;n + l; z^") 



(36) 



for ah G (n G No), setting iFi(l; 1; — i. if ^e combine (jsTj) and (jss]) with (jSGI) 
and the fact that vr,- - = 



, ij G {0, 1}, the theorem follows. □ 

Now we consider the £{1/2, A) model with a = /3 = 1. On the one hand, we lose one of 
the fitting parameters, although, the model is still well suited to model the term structure, 
and it yields the famous Vasicek model in the limit (see Section [5]). On the other hand, we 
obtain a more tractable pricing formula for ZCBs, where, in contrast to the general case, no 
multiple sums need calculation, which improves the computational speed. 

The calculation of the arbitrage-free ZCB price (26) in this setting is very intuitive and 
requires no knowledge of the transition probabilities of {Rt)t>o^ since the only stochastic 
parameters are the arrival times of the underlying Poisson process. 



4.2 Theorem (ZCB price in iS(l/2,A) model). Let {Rt)t>o be given by the definition (21) 
with A > and a = f3 = 1. The price at time t >0 of a ZCB with maturity at T is given by 



where k :- 



P{t,T) = e-(^™+^^)(^-*) . Pi{t,Tf ■ Po{t,T)^^^ 

^^^e{o,...,iV}, 

g(A(r-t))2" 

n=0 



Pi{t,T) 



(2n)! 



(2n+l) 



(37) 



(38) 



+ ^} ■ iFi(l;2n + 2;-z 



2n + l 



Poit,T) 



E 

n=0 



(A(r-t))^" 

(2n)! 

A(r-t 



{ 



iFi l;2n+ 



(2n) 



+ ^^e-'^^^-*). iFi(l;2n + 2;.(^"+i))}, 



where z^^r^) := h{T - t)(0, 1, . . . , 0, 1)^ G M^", := h{T - t)(l, 0, 1, . . . , 0, 1)^ G 

and iFi is defined by Uu. 



(39) 



p2n+l 



Proof. Let rk = hk + r^, A; G {0, . . . , N}, be the state of {Rt)t>o at time t. Analogue to the 

we obtain 

(40) 



derivation of the expresion (31) in the proof of Theorem |4.1[ we obtain 

P{t,T) = e-''™^^-*) • Pi{t,T)'' • Po(^,T)^-^ 
where 



Py{t,T) :=Ey,t 



exp 



/lYo ds 



ye {0,1}. 



(41) 



In order to evaluate Py{t, T), we count the number of jumps in the underlying Poisson process 
{Nt)t>o within the time interval {t,T], and, denoting the jump times by (Tj)jgj^ and setting 



To := t, we split the integral on the right-hand side of (41), obtaining 

XD / 1 



Py{t,T)=E 



y,t 



El 

.n.=0 



{Nt 



• exp 



E 

i=0 



HYj ds 



hYn ds 
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At this point, we have to distinguish between even and odd numbers of jumps, since (Yn)neN 
switches between and 1 P— a.s. according to its transition probabihty matrix ( |13| ). Thus, 
conditional on {Iq = 1}) the Markov chain (l^)neN stays in 1 after an even jump, whereas 
it stays in after an odd jump. This consideration yields 



Pi{t,T) = 



n=0 



/ n-1 

l{JVT_t=2n} • exp - ^ 



+ E 



hds — hds 

Q -J T2i J r2n J 

(" rT2i+i ^ 
/ hds 
i = -^^^i J 



ri=0 



E 



/ 2ri 

exp ( /i-^(-l)Vi + t-r 

/ 2n+l \ 



exp 



h- 5^(-l)V, + t 



i=l 



Nr-t = 2n 
Nr-t = 2n + 1 



• P {Nr-t = 2n) 

• P (TVt-* = 2n + 1) I. 



Furthermore, from the order statistics property of the Poisson process (see, for example, [9], 
pp. 101-102), we know that the joint density of the arrival times ri, . . . , (/c G N) of {Nt)t>o 
in {t,T], conditional on {Nr-t = k}, is given by 



< Ti < T2 < ■ ■ ■ < Tk <T\NT-t = k) 



I [ ■■■ [ dtk...dt2dh. (42) 



(T-t) 



Hence, for Pi{t,T) we obtain 



n=0 



-h{T-t) _ 



(2n)! 



(T - t)2« 
-X(T-t) {KT-t)f 



2n 



exp 



t2n~l 



h-Y,i-^yti dt2n...dtl 



i=l 



+ 



(2n)! 
(2n + l)! 

.,(^_,)(A(r-0)^ 

(2n + 1)! 



„T / 2n+l \ 

/ exp /i • ^ (-l)*ii dt2n+i ...dti 

Jt2n \ i = l / 



(43) 



Analogue to the proof of Theorem 4.1, we consider the mapping J given in ( 34 ) and integrate 



( 43 ) by substitution, which yields 



pf)(t,r) = ,-A(T-t) J2 ^^^^ ".^.^^ " e-^(^-*) • (2n)! / e<^*'"^'-) 

n=0 [ -^^2" 

+ (2n + l)! / e^~^''"^'''^Ux\, 

J A2n + 1 J 



(44) 



where — h{T - t)(0, 1, . . . , 0, 1)^ G M^", z(2n+i) — /^(r - t)(l, 0, 1, . . . , 0, 1)^ G M2"+i, 



and (• , •) denotes the standard inner product. Applying the relation (36), we rewrite (44) as 
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follows: 



-A(T- 



a^ i\{T-t)) 



E 

n=0 



2n 



(2n)! 



|e-MT-t). iFi(l;2n + l;z(2-) 



+ ^;|^-iFi(l;2n + 2;-z(2"+i: 
2n + 1 V 



In similar fashion, we obtain 



(45) 



ra=0 



(A(r-t)) 

(2n)! 



2n 



iFi fl;2n+l;-z(2") 



M^^e-'^^^-*) iFi f 1; 2n + 2; ^(^n+i)) \ 
2n + 1 V /J 



-A(T~t) 



Po(t,T). 



If we combine ( 45 ) and ( 46 ) with (|40p , the theorem fohows 



(46) 
□ 



4.3 Remark. We notice that Theorems 4.1 and 4.2 are based on two completely different 
approaches and yield different representations of the ZCB prices. However, both formulae 
involve the confluent hypergeometric function iFi defined by ([T]). 



Practical implementation From Theorems 4.1 and 4.2 the ZCB prices can be computed 
approximately by truncating the series in the according formulae. We also use the truncated 
iF]^ function defined by Ml) as an approximation for the iFi function. 



(47) 



Thus, in the setting of Theorem 4.1, we truncate the sum of series (28), obtaining 
P{t,T-M,H) 



Py{t,T-M,H) 



^' {-h{T-t)Y 



n=l 

E-E^^-. 

il=0 in=0 



n / \ li 
p 



n 



B. 



■ iFf (l;n + l;z 



(n) 



for y £ {0, 1}. 



In the setting of Theorem 4.2, we truncate the series ([38j) and (39), obtaining 



P{t,T; M,H) 
Pi{t,T; M,H) 



(48) 



M 

E 

n=0 



(A(T-t)) 



2n 



(2n)! 



. je^MT-t) . ^pH J^^. 2n + 1; h{T - t) 



.(2n) 



+ ^^L_J1 . ^Ff (l; 2n + 2; -h{T - t)z^^^+''' 



2n + 1 



}• 



Po{t,T;M,H) 



M 

E 

n=0 



(A(T-t) ) 
(2n)\ 



2n 



I iF^ (l; 2n + 1; -h{T - t)z 



(2n) 



+ A(^-^) ^-MT-t) . ^^1/ . 2^ + 2; /i(r - t)z(2«+i)) j. 
2n + 1 \ /J 
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The choice of the truncation parameters M and H is left to the practitioner and should 
be made in the way of maintaining a balance between the accuracy of the results and the 
computational speed. Some numerical examples, which provide numerical accuracy and 
computational speed for the formulae (47) and (48), will be given at the end of the next 
section. In the following we omit explicitly writing out the dependence on M and H when 
it is clear from the context. 



5 Connection to the Vasicek model 

The Vasicek model [20] is one of the most popular short-rate models. Closed-form expressions 
of ZCB prices and European options thereof make the model highly appealing to practitioners. 
However, it also has some shortcomings. 

In this section we give a short description of the Vasicek model. We point out its advan- 
tages and disadvantages. Here we follow paragraph 3.2.1 of [1]. We provide a convergence 
result, which shows that after a linear rescaling the £^(1/2, A) model converges weakly to the 
Vasicek model. We show the convergence of the respective ZCB prices and provide some 
numerical examples. 



Vasicek model The formulation of the Vasicek model under the risk-neutral measure P is 



dr{t) = k[e - r{t)]dt + adWu r(0) = tq, (49) 

where k, 9, a, ro are positive constants, and (Wt)t>o is the standard Wiener process on the 
probability space {Q, J-, {J-t)t>Oi^) with the natural filtration {J-t)t>o- Integration of the 
equation ( 49 ) yields for i > s 

r{t) = r(s)e-'=(*-^) + 6 (l - e-'^^*"^)) + a ^-^{t-u) ^^^^ ^^q) 
Thus, conditional on J^g (t > s), r{t) is normally distributed with mean and variance 

E[r{t)\Ts] = r(s)e-'=(*-^) + (l - e-'^^*-^)) , (51) 

Var[r(t)|J-,] = ^ (l - e-^'^^*"^)) . (52) 

Hence, the short-rate process {rt)t>o tends to the mean-reverting value for t — )• oo. The 
drawbacks of the model are the possible negativity of the interest rates, implied by the 
Gaussian distribution, and the fact that it is driven by only three parameters, which makes 
the calibration an ill-posed problem and yields often poor results. 

The price at time i > of a ZCB with maturity at T, conditional on r(t) = r, is given by 

p(t,r) = A(^,^)e-^(*'^)■^ (53) 

where 

A{t,T) = e^pl^(e-^yB{t,T)-T + t]-^B{t,Tf 
B{t,T) = ^ (1 - e-'^^^-*) 
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Convergence results It is well known that the Ehrenfest process converges weakly to 
the Ornstein-Uhlenbeck process (see, for instance, |9j, pp. 168-173, or [Uj). The following 
theorem shows that the i?(l/2. A) model also converges weakly to the Vasicek model. 

5.1 Theorem. Let {r{t))t>o be given as in (49). Consider {R[^^)t>o as defined in (21) with 



a = /3 G (0, 1], A := a/ {a + /3), rm ■= - ay ^ and rM ■= + ay ^. Then, 

{Ri^^)te[o,T] {r{t))te[o,T] as iV ^ oo, 
where " =^ " denotes the weak convergence. 

Proof. The proof follows analogue to |9j, pp. 168-173, when we compute the conditional 
moments of ARt := Rt+At — Rt- CH 

5.2 Remark. Karlin and McGregor show in a rigorous way (see |10j . pp. 371-373), that 



the transition probability function (17) of the Ehrenfest process converges locally uniformly 
to the transition probability function of the Ornstein-Uhlenbeck process as iV — )• oo, which 
sharpens the above result after a linear transformation of the underlying processes. 



A direct consequence of Theorem 5.1 is the convergence of the respective ZCB prices. 



5.3 Corollary. Consider (^"(t))fG[o.T] and (i?[^'*)tg[o^r] as in Theorem 5.1 and let P{t,T) 



and P^^\t,T) denote the associated ZCB prices at time t with maturity at T given by (53) 
and ( f3?| ) respectively. Then, 

p(^){t,T)''-^ P{t,T). 

Proof. W.l.o.g. let t = 0. We denote by D := D[0, T] the space of the real- valued functions on 
[0,T] that are right continuous and have left-hand limits (RCLL). From [T] (see p. 123), we 
know that a metric exists that makes D a Polish space, i.e. a metric, separable and co mple te 



5.1 



it 



space. Clearly, R]\j := (-R|^'')tg[o,T] aiid ^ '■= (^(*))ie[o.T] both lie in D. With Theorem 
follows that Rn ^ r iuD. 

Consider a linear operator 5" on D, defined by 

{Sf){t) := [ /(s) ds for / G D and t G [0,r]. 
Jo 

Clearly, 5" is a continuous operator on D. Then, the operator S, defined by 

{Sf){t) := exp (^-{SRN){t)^ = exp f{s) ds^ for / G D and t G [0,r], 

is a continuous operator on D. Let ■= {SR^^\t))f^^Q^x] and Y := (5r(t))ig[o^r] . Then, 
Theorem 5.1 in yields liv =^ Y. Since Y^ is uniformly integrable, it follows from Theorem 
5.4 in [U that 

W.[Yn]^^^[Y], 

which completes the proof. □ 
Figure [l] illustrates the convergence result of Corollary 5.3 All computations were made 



on an INTEL Core2Duo 2400MIIz machine. We consider two scenarios: in case (a) we have a 
favourable set of parameters for the ZCB valuation; in case (6) we choose an unrealistic high 
of 20% for the interest rate market volatility and a time to maturity of 10 years. Within the 
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Vasicek model the valuation is done according to the pricing formula (53). The approximative 



values P{0,T; M, H) of the ZCB prices in the £^(1/2, A) model are computed according to 
Corallary 5.3 via (48) with M = 10 and H = 30. In both cases we observe fast convergence 



of the respective prices. 



200 300 
N 

(a) Favourable case 



200 300 
N 

(b) Unfavourable case 



Figure 1: Relative price errors against N approximating ZCB prices P(0,T) in the 
Vasicek model by p(^)(0, T; 10, 30) in the £{1/2, X) model via (48) in two scenarios: 
(a) Vasicek model with a = 0.2, a = 0.05, T = 1 year, 9 = 8%, and ro = 5%; 
(6) Vasicek model with a = 0.2, a = 0.2, T = 10 years, 9 = 8%, and ro = 5%. 



Furthermore, we see that the choice of the truncating parameters M = 10 and H = 30 
is satisfactory for our purpose. The computation time of one ZCB price via (48) is less than 
0.1 seconds. Similar computation in the £{p, A) model according to (47) takes 2.67 seconds. 



6 Discussion 



In this section we discuss the advantages of the £{p, A) model with respect to the positivity 
of the interest rates. We use the case study of a ZCB valuation, showing that the £{p, A) 
model can still be used when the Vasicek model reaches its limits. 

The main shortcoming of all models with Gaussian distribution, including the Vasicek 
model, is the positive probability of the interest rates becoming negative. Although this 
probability is rather small, some problems may appear while valuing ZCBs with long residual 
maturity. For instance, Rogers [15] illustrates how an attempt to keep the probability of 
negative interest rates negligible by choosing suitable parameters of the Vasicek model in the 
limiting case t — )• oo leads to an exponential growth in t of the ZCB prices. Conversely, the 
£{p, A) model allows the choice of the lower and upper bounds and rpi for the interest 
rate, and excludes the possibility of negative as well as unrealistically high positive interest 
rates. 

Times of financial crisis are often accompanied by interest rates near 0%, as we see 
at present. The following example of pricing ZCBs in a respective scenario illustrates the 
advantage of the £{p, A) model over the Vasicek model. First, we assume the Vasicek model 
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given according to (49) with = 4%, a = 0.05, k = 0.1 and r(0) = 1%. Figure [2] (a) shows 
three sample paths of the underlying process {rt)t>Q over a period of 30 years simulated 



according to ( 50 ) . We see that every path of the simulated process spends some time below 



the zero mark. Figure [2] (6) demonstrates the weakness of the model in the case at hand, as 
we observe that the ZCB prices are not monotone falling in the time to maturity and even 
exceed the upper bound of 1 monetary unit, which is contradictory to no-arbitrage principles. 




15 20 
Time (years) 

(a) Short-rate 



10 15 20 

Time to maturity (years) 

(b) ZCB prices 



Figure 2: (a) Three sample paths of the short-rate process (49) in the Vasicek model with 
a = 0.1, e = 4%, a = 0.05 and r(0) = 1%. (6) ZCB prices in the Vasicek model with the 
given parameters and residual maturities from 1 to 30 years. 



Now we consider the £{p,\) model in a similar hypothetical setting. We set the lower 
and upper bounds at = 0% and tm = 16%, and the state space discretization parameter 
N = 160. We choose A = 1, a = 0.1 and /3 = 0.3, in that we have with (24) a mean-reverting 
value of 4% as in the case above. Here, we set i?o = 1% as well. Figure 3[(a) demonstrates a 
possible trajectory of the short-rate process {Rt)t>o over 30 years, simulated on the basis of 
the underlying distribution. In Figure [s] (5) we see the strictly monotone decreasing character 
of the respective ZCB prices as a function of the time to maturity, which is highly plausible. 



7 Conclusions 

This paper has explored a finite-state mean-reverting short-rate model based on the Ehrenfest 
process. The respective short-rate process can be seen as an affine linearly transformed birth- 
and-death process on {0, 1, • • • , N}, N £ N. The model provides a certain degree of analytical 
tractability, since it allows explicit pricing of ZCBs and solves the problem of negative interest 
rates characteristic of Gaussian models. The pricing formulae for ZCBs have been derived for 
both the general case and the special case, in which the underlying distribution is symmetric 
with respect to the mean-reverting value. The key to both approaches has turned out to 
be the representation of the underlying Ehrenfest process as a sum of independent binary 
processes, which has been possible only in continuous time. We also used the hypergeometric 
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10 15 2' 

Time (years) 

(a) Short-rate 



(b) ZCB prices 



Figure 3: (a) A sample path of the short-rate process (21) in the £{p,X) model with a = 
0.1, /3 = 0.3, A = 1, = 0%, tm = 16%, N = 160 and Ro = 1%. (b) ZCB prices in the 
£{p, A) model with the given parameters and residual maturities from 1 to 30 years. 



functions of a matrix argument and the Krawtchouk polynomials. The special case benefits 
also from a more tractable pricing formula for ZCBs. 

We have seen that the Ehrenfest short-rate model is a good approximation to the Vasicek 
model under normal conditions and a better alternative to it in extreme cases, where the 
interest rates are low and the volatility is high, providing solely positive interest rates. A 
further advantage of the model is the availability of five fitting parameters in the general 
case. 

Our conclusion is that especially the general case of the Ehrenfest short-rate model is an 
interesting enrichment in the field of term structure modelling, combining analytical tractabil- 
ity with the desired property of interest rates remaining positive. 

Problems that remain open for the short-rate model that we have examined here are the 
derivation of an explicit pricing formula for European options on ZCBs, parameter estimates 
for the model under the objective measure, and an extension of the model according to the 
three urn Ehrenfest model (see. [10], pp. 363 - 368). 
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